Cystic fibrosis transmembrane conductance regulator gene mutations

ABSTRACT

The present invention provides novel mutations of the CFTR gene related to cystic fibrosis or to conditions associated with cystic fibrosis. Also provided are probes for detecting the mutant sequences. Methods of identifying if an individual has a genotype containing one or more mutations in the CFTR gene are further provided.

FIELD OF THE INVENTION

The present invention relates to novel cystic fibrosis transmembrane regulator (CFTR) gene mutations and to methods for detecting the presence of these mutations in individuals.

BACKGROUND OF THE INVENTION

The following description of the background of the invention is provided simply as an aid in understanding the invention and is not admitted to describe or constitute prior art to the invention.

Cystic fibrosis (CF) is the most common severe autosomal recessive genetic disorder in the Caucasian population. It affects approximately 1 in 2,500 live births in North America (Boat et al., The Metabolic Basis of Inherited Disease, 6^(th) ed., pp 2649-2680, McGraw Hill, NY (1989)). Approximately 1 in 25 persons of northern European Caucasian descent are carriers of the disease. The responsible gene has been localized to a 250,000 base pair genomic sequence present on the long arm of chromosome 7. This sequence encodes a membrane-associated protein called the “cystic fibrosis transmembrane regulator” (or “CFTR”). There are greater than 1000 different mutations in the CFTR gene, each having varying frequencies of occurrence in different populations, presently reported to the Cystic Fibrosis Genetic Analysis Consortium. These mutations exist in both the coding regions (e.g., ΔF508, a mutation found on about 70% of CF alleles, represents a deletion of a phenylalanine at residue 508) and the non-coding regions (e.g., the 5T, 7T, and 9T variants correspond to a sequence of 5, 7, or 9 thymidine bases located at the splice branch/acceptor site of intron 8) of the CFTR gene.

The major symptoms of cystic fibrosis include chronic pulmonary disease, pancreatic exocrine insufficiency, and elevated sweat electrolyte levels. The symptoms are consistent with cystic fibrosis being an exocrine disorder. Although recent advances have been made in the analysis of ion transport across the apical membrane of the epithelium of CF patient cells, it is not clear that the abnormal regulation of chloride channels represents the primary defect in the disease.

A variety of CFTR gene mutations are known. The identification of additional mutations will further assist in the diagnosis of cystic fibrosis.

SUMMARY OF THE INVENTION

The inventors have discovered new mutations in the CFTR gene. These mutations, include 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G, are related to the function of the CFTR gene and, therefore, to cystic fibrosis. These mutations are associated with cystic fibrosis or are associated with conditions associated with cystic fibrosis. By “conditions associated with cystic fibrosis” is meant any clinical symptoms that may be found in a cystic fibrosis patient and are due to one or more CF mutations.

Accordingly, in one aspect, the present invention provides a method of determining if a CFTR gene contains one or more mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G, comprising determining whether CFTR nucleic acid contains one or more of said mutations.

In another aspect, the present invention provides a method of identifying if an individual has one or more mutations in the CFTR gene comprising determining if nucleic acid from the individual has one or more mutations in one or both CFTR genes, the mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G.

In yet another aspect, the present invention provides a method of determining if an individual is predisposed to cystic fibrosis or to a condition associated with cystic fibrosis comprising determining if nucleic acid from the individual has one or more mutations in one or both CFTR genes, the mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375-2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G.

In still a further aspect, the present invention provides a method of counseling an individual on the likelihood of having an offspring afflicted with cystic fibrosis or a condition associated with cystic fibrosis, comprising determining if nucleic acid from the individual has one or more mutations in one or both CFTR genes, the mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G.

In some embodiments, the mutations are selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500 is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), and 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted). In other embodiments the mutations are selected from the group consisting of 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, and −141C>A.

In some embodiments, one or more mutations are evaluated for both alleles of the CFTR gene in the individual. By this approach the genotype of the individual can be determined at the position of each mutation.

The presence of the mutation in the CFTR gene may be determined by any of a variety of well known methods used to detect single base changes (transitions, transversions, and/or small deletions/insertions). Thus, genomic DNA may be isolated from the individual and tested for the CF mutations. In another approach, mRNA can be isolated and tested for the CF mutations. Testing may be performed on mRNA or on a cDNA copy.

Genomic DNA or cDNA may be subject to amplification by the polymerase chain reaction or related methods using primers directed to specific portions of the CFTR gene which contain a mutation to be detected. The sequences of primers suitable for PCR amplification of portions of the CFTR gene in which contain the CF mutations are also provided.

The presence CF mutations can be determined in a nucleic acid by sequencing appropriate portions of the CFTR gene containing the mutations sought to be detected. For example, each amplicon of the CFTR gene is sequenced with both M13 forward and reverse primers. In another approach, CF mutations that change susceptibility to digestion by one or more endonuclease restriction enzymes may be used to detect the mutations. In another embodiment, the presence of one or more CF mutations can be determined by allele specific amplification. In yet another embodiment, the presence of one or more CF mutations can be determined by primer extension. In yet a further embodiment, the presence of one or more CF mutations can be determined by oligonucleotide ligation. In another embodiment, the presence of one or more CF mutations can be determined by hybridization with a detectably labeled probe containing the mutant CF sequence.

According to the invention, the presence of CF mutations can also be determined by analyzing the CF protein encoded by the mutated CF gene. The mutations include, for example, E1104V, deletion of L997, G646X, deletion of Y1014, D924H, 11328T, K1351N, L130V, Q237P, A1466S, 1106V, 1148F, M 1191T, Y122C, V915L, 1853V, A252T, R1438Q or frameshift mutations.

Detection of CF mutations at the protein level can be detected by any method well known in the field. In one embodiment, detection of CF mutations is carried out by isolating CF protein and subjecting it to amino acid sequence determination. This may require fragmenting the protein by proteolytic or chemical means prior to sequencing. Method of determining an amino acid sequence are well known in the art.

In other embodiments, the presence of CFTR mutations is determined using antibodies that bind specifically to a mutant CFTR protein sequence. For example, ELISA or other immunological assays known to a person skilled in the art can be used to detect CFTR mutations using specific antibodies for each mutation. Method of producing antibodies to specific sequence of a protein such as a mutation containing sequence are well known. For example, one may immunize an animal with the mutant CFTR protein or with peptide fragments of the mutant protein containing the mutant sequence. If monoclonal antibodies are produced, those specific for the mutant sequence can be obtained by screening the antibodies for differential reactivity between the mutant CFTR protein and wildtype CFTR protein. If a mutation specific polyclonal antisera is desired, one may process the initial antisera by removing antibodies reactive with the wildtype CFTR protein. Optionally, such antisera may be concentrated by affinity chromatography using the mutant CFTR protein. Further steps to remove wild-type CFTR reactivity may be conducted.

Methods for developing monoclonal and polyclonal antibodies to defined epitopes of the CFTR protein have been previously described. See, e.g., U.S. Pat. No. 5,981,714 (Cheng et al.,); Cohn et al., Biochem Biophys Res Commun. 1991 Nov. 27; 181(1): 36-43; Walker et al, J Cell Sci. 1995 June; 108 (Pt 6): 2433-44; Klass et al. J Histochem Cytochem. 2000 June; 48(6):831-7; Doucet et al., J Histochem Cytochem. 2003 September; 51(9): 1191-9; Carvelho-Oliveira et al. J Histochem Cytochem. 2004 February; 52(2): 193-203; and Mendes et al. J Cyst Fibros. 2004 August; 3 Suppl 2:69-72.

The methods of the invention also may include detection of other CF mutations which are known in the art and which are described herein.

The present invention also provides oligonucleotide probes that are useful for detecting the CF mutations. Accordingly, provided is a substantially purified nucleic acid comprising 8-20 nucleotides fully complementary to a segment of the CFTR gene that is fully complementary to a portion of the CFTR gene and encompasses a mutant CFTR sequence selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G, or a complementary nucleic acid sequence thereof. In one embodiment, the purified nucleic acid is no more than 50 nucleotides in length. The invention CF mutant probes may be labeled with a detectable label, which may include any of a radioisotope, a dye, a fluorescent molecule, a hapten or a biotin molecule.

In another aspect the present invention provides kits for one of the methods described herein. In various embodiments, the kits contain one or more of the following components in an amount sufficient to perform a method on at least one sample: one or more primers of the present invention, one or more devices for performing the assay, which may include one or more probes that hybridize to a mutant CF nucleic acid sequence, and optionally contain buffers, enzymes, and reagents for performing a method of detecting a genotype of cystic fibrosis in a nucleic acid sample.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a table showing various CFTR mutations and characterizing information. A normal immunoreactive trypsinogen (IRT) value is 30 μg/L or lower. An elevated IRT value is an indication of possible CF condition. The normal range for a sweat test is a chloride value less than 40 mEq/L. An elevated chloride value is an indication of possible CF condition. A normal individual has a value of greater than 480 μg/g for a stool elastase test, while a value under 100 μg/g indicates severe pancreatic insufficiency.

DETAILED DESCRIPTION OF THE INVENTION

CF mutations and exemplary PCR primer pairs for amplifying segments of the CFTR gene containing the mutations are shown in Table 1.

TABLE 1 CF mutations and associated amplification primers CF Mutation CF Mutation Forward and Reverse PCR Nucleic acid Protein Amplification Primers 3443A>T E1104V q17be1F (SEQ ID NO: 33) and q17be1R (SEQ ID NO: 34) 2443delA (A at position 2443 is frameshift q13-2e1F (SEQ ID NO: 23) and deleted) q13-2e1R (SEQ ID NO: 24) 2777insTG (TG are inserted at frameshift q14be2F (SEQ ID NO: 27) and position 2777) q14be2R (SEQ ID NO: 28) 3123-3125delGTT (GTT at deletion of q17ae1F (SEQ ID NO: 31) and positions 3123-3125 are deleted) L997 q17ae1R (SEQ ID NO: 32) 4177delG (G at position 4177 is frameshift q22e1F (SEQ ID NO: 39) and deleted) q22e1R (SEQ ID NO: 40) 630delG (G at position 630 is frameshift g5e3F (SEQ ID NO: 11) and deleted) g5e4R (SEQ ID NO: 12) 2068G>T G646X q13-1e1F (SEQ ID NO: 21) and q13-1e2R (SEQ ID NO: 22) 1342−2A>G (A in the splice splicing g9e9F (SEQ ID NO: 19) and acceptor site of intron 8, 2 g9e11R (SEQ ID NO: 20) nucleotides upstream of position 1342, is substituted with G) 297−1G>A (G in the splice splicing s3e1F (SEQ ID NO: 7) and acceptor site of intron 2, 1 s3e2R (SEQ ID NO: 8) nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice splicing q18e1F (SEQ ID NO: 35) and acceptor site of intron 17b, 2 q18e1R (SEQ ID NO: 36) nucleotides upstream of position 3500, is substituted with T) 4375−2A>G (A in the splice splicing q24e1F (SEQ ID NO: 41) and acceptor site of intron 23, 2 q24e1R (SEQ ID NO: 42) nucleotides upstream of position 4375, is substituted with G) 3172-3174delTAC (TAC at deletion of q17ae1F (SEQ ID NO: 31) and positions 3172 to 3174 are deleted) Y1014 q17ae1R (SEQ ID NO: 32) 2902G>C D924H q15e3F (SEQ ID NO: 29) and q15e4R (SEQ ID NO: 30) 4115T>C I1328T q22e1F (SEQ ID NO: 39) and q22e1R (SEQ ID NO: 40) 4185G>C K1351N q22e1F (SEQ ID NO: 39) and q22e1R (SEQ ID NO: 40) 520C>G L130V q4e1F (SEQ ID NO: 9) and q4e1R (SEQ ID NO: 10) 842A>C Q237P q6ae1F (SEQ ID NO: 13) and q6ae1R (SEQ ID NO: 14) 4528G>T A1466S q24e1F (SEQ ID NO: 41) and q24e1R (SEQ ID NO: 42) 448A>G I106V q4e1F (SEQ ID NO: 9) and q4e1R (SEQ ID NO: 10) 574A>T I148F q4e1F (SEQ ID NO: 9) and q4e1R (SEQ ID NO: 10) 3704T>C M1191T q19e3F (SEQ ID NO: 37) and q19e4R (SEQ ID NO: 38) 1248+5T>C (T in the splice donor q7e3F (SEQ ID NO: 17) and site of intron 7, 5 nucleotides q7e4R (SEQ ID NO: 18) downstream of position 1248, is substituted with C) 296+12T>G (T in intron 2, 12 q2e2F (SEQ ID NO: 5) and nucleotides downstream of q2e2R (SEQ ID NO: 6) position 296, is substituted with G) 3849+3G>A (G in the splice donor q19e3F (SEQ ID NO: 37) and site of intron 19, 3 nucleotides q19e4R (SEQ ID NO: 38) downstream of position 3849, is substituted with A) 497A>G Y122C q4e1F (SEQ ID NO: 9) and q4e1R (SEQ ID NO: 10) −141C>A q-promoter-2-1F (SEQ ID NO: 3) and q-promoter-2-1R (SEQ ID NO: 4) 2875G>C V915L q15e3F (SEQ ID NO: 29) and q15e4R (SEQ ID NO: 30) 2689A>G I853V q14ae5F (SEQ ID NO: 25) and q14ae6R (SEQ ID NO: 26) 3039A>G A969A q15e3F (SEQ ID NO: 29) and q15e4R (SEQ ID NO: 30) 405G>C G91G s3e1F (SEQ ID NO: 7) and s3e2R (SEQ ID NO: 8) 886G>A A252T q6be2F (SEQ ID NO: 15) and q6be2R (SEQ ID NO: 16) 4445G>A R1438Q q24e1F (SEQ ID NO: 41) and q24e1R (SEQ ID NO: 42) −228G>C q-promoter-2-1F (SEQ ID NO: 3) and q-promoter-2-1R (SEQ ID NO: 4) −295C>T q-promoter-1-1F (SEQ ID NO: 1) and q-promoter-1-1R (SEQ ID NO: 2) −379delC (C at position −379 is q-promoter-1-1F (SEQ ID NO: 1) and deleted) q-promoter-1-1R (SEQ ID NO: 2) −540A>G q-promoter-1-1F (SEQ ID NO: 1) and q-promoter-1-1R (SEQ ID NO: 2)

Further information relating to the CF mutations and the CFTR gene are found in FIG. 1. The primers for amplifying segments of the CFTR gene may hybridize to coding or non-coding CFTR sequences under stringent conditions. Preferred primers are those that flank mutant CF sequences.

By “mutations of the CFTR gene” or “mutant CF sequence” is meant one or more CFTR nucleic acid sequences that are associated or correlated with cystic fibrosis. The CF mutations disclosed in Table 1 may be correlated with a carrier state, or with a person afflicted with CF. Thus, the nucleic acid may be tested for any CF mutation described in Table 1. The nucleic acid sequences containing CF mutations are preferably DNA sequences, and are preferably genomic DNA sequences; however, RNA sequences such as mRNA or hnRNA may also contain nucleic acid mutant sequences that are associated with cystic fibrosis.

By “carrier state” is meant a person who contains one CFTR allele that is a mutant CF nucleic acid sequence, but a second allele that is not a mutant CF nucleic acid sequence. CF is an “autosomal recessive” disease, meaning that a mutation produces little or no phenotypic effect when present in a heterozygous condition with a non-disease related allele, but produces a “disease state” when a person is homozygous, i.e., both CFTR alleles are mutant CF nucleic acid sequences.

By “primer” is meant a sequence of nucleic acid, preferably DNA, that hybridizes to a substantially complementary target sequence and is recognized by DNA polymerase to begin DNA replication.

By “substantially complementary” is meant that two sequences hybridize under stringent hybridization conditions. The skilled artisan will understand that substantially complementary sequences need not hybridize along their entire length. In particular, substantially complementary sequences comprise a contiguous sequence of bases that do not hybridize to a target sequence, positioned 3′ or 5′ to a contiguous sequence of bases that hybridize under stringent hybridization conditions to a target sequence.

By “flanking” is meant that a primer hybridizes to a target nucleic acid adjoining a region of interest sought to be amplified on the target. The skilled artisan will understand that preferred primers are pairs of primers that hybridize upstream of a region of interest, one on each strand of a target double stranded DNA molecule, such that nucleotides may be added to the 3′ end of the primer by a suitable DNA polymerase. Primers that flank mutant CF sequences do not actually anneal to the mutant sequence but rather anneal to sequence that adjoins the mutant sequence.

By “isolated” a nucleic acid (e.g., an RNA, DNA or a mixed polymer) is one which is substantially separated from other cellular components which naturally accompany such nucleic acid. The term embraces a nucleic acid sequence which has been removed from its naturally occurring environment, and includes recombinant or cloned DNA isolates, oligonucleotides, and chemically synthesized analogs or analogs biologically synthesized by heterologous systems.

By “substantially pure” a nucleic acid, represents more than 50% of the nucleic acid in a sample, The nucleic acid sample may exist in solution or as a dry preparation.

By “complement” is meant the complementary sequence to a nucleic acid according to standard Watson Crick pairing rules. A complement sequence can also be a sequence of RNA complementary to the DNA sequence or its complement sequence, and can also be a cDNA.

By “coding sequence” is meant a sequence of a nucleic acid or its complement, or a part thereof, that can be transcribed and/or translated to produce the mRNA for and/or the polypeptide or a fragment thereof. Coding sequences include exons in a genomic DNA or immature primary RNA transcripts, which are joined together by the cell's biochemical machinery to provide a mature mRNA. The anti-sense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom.

By “non-coding sequence” is meant a sequence of a nucleic acid or its complement, or a part thereof, that is not transcribed into amino acid in vivo, or where tRNA does not interact to place or attempt to place an amino acid. Non-coding sequences include both intron sequences in genomic DNA or immature primary RNA transcripts, and gene-associated sequences such as promoters, enhancers, silencers, etc.

Nucleic acid suspected of containing mutant CF sequences are amplified using one or more primers that flank the mutations under conditions such that the primers will amplify CFTR fragments containing the mutations, if present. The oligonucleotide sequences in Table 1 are useful for amplifying segments of the CFTR gene which contain the mutations in FIG. 1.

The method of identifying the presence or absence of mutant CF sequence by amplification can be used to determine whether a subject has a genotype containing one or more nucleotide sequences correlated with cystic fibrosis. The presence of a wildtype or mutant sequence at each predetermined location can be ascertained by the invention methods.

By “amplification” is meant one or more methods known in the art for copying a target nucleic acid, thereby increasing the number of copies of a selected nucleic acid sequence. Amplification may be exponential or linear. A target nucleic acid may be either DNA or RNA. The sequences amplified in this manner form an “amplicon.” While the exemplary methods described hereinafter relate to amplification using the polymerase chain reaction (“PCR”), numerous other methods are known in the art for amplification of nucleic acids (e.g., isothermal methods, rolling circle methods, etc.). The skilled artisan will understand that these other methods may be used either in place of, or together with, PCR methods.

The nucleic acid suspected of containing mutant CF sequence may be obtained from a biological sample. By “biological sample” is meant a sample obtained from a biological source. A biological sample can, by way of non-limiting example, consist of or comprise blood, sera, urine, feces, epidermal sample, skin sample, cheek swab, sperm, amniotic fluid, cultured cells, bone marrow sample and/or chorionic villi. Convenient biological samples may be obtained by, for example, scraping cells from the surface of the buccal cavity. The term biological sample includes samples which have been processed to release or otherwise make available a nucleic acid for detection as described herein. For example, a biological sample may include a cDNA that has been obtained by reverse transcription of RNA from cells in a biological sample.

By “subject” is meant a human or any other animal which contains a CFTR gene that can be amplified using the primers and methods described herein. A subject can be a patient, which refers to a human presenting to a medical provider for diagnosis or treatment of a disease. A human includes pre and post natal forms. Particularly preferred subjects are humans being tested for the existence of a CF carrier state or disease state.

By “identifying” with respect to an amplified sample is meant that the presence or absence of a particular nucleic acid amplification product is detected. Numerous methods for detecting the results of a nucleic acid amplification method are known to those of skill in the art.

Specific primers may be used to amplify segments of the CFTR gene that are known to contain mutant CF sequence. By amplifying specific regions of the CFTR gene, the primers facilitate the identification of wildtype or mutant CF sequence at a particular location of the CFTR gene. Primers for amplifying various regions of the CFTR gene include the following: SEQ ID NO 1: (q-promoter-1-1F) TGTAAAACGACGGCCAGTcgtgtcctaagatttctgtg and SEQ ID NO 2: (q-promoter-1-1R) CAGGAAACAGCTATGACCCTTTCCCGATTCTGACTC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 3: (q-promoter-2-1F) TGTAAAACGACGGCCAGTtgccaactggacctaaag and SEQ ID NO 4: (q-promoter-2-1R) CAGGAAACAGCTATGACCCAAACCCAACCCATACAC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 5: (q2e2F) TGTAAAACGACGGCCAGTcataattttccatatgccag and SEQ ID NO 6: (q2e2R) CAGGAAACAGCTATGACCTATGTTTGCTTTCTCTTCTC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 7: (s3e1F) TGTAAAACGACGGCCAGTcttgggttaatctccttgga and SEQ ID NO 8: (s3e2R) CAGGAAACAGCTATGACCATTCACCAGATTTCGTAGTC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 9: (q4e1F) TGTAAAACGACGGCCAGTaaagtcttgtgttgaaattctcagg and SEQ ID NO 10: (q4e1R) CAGGAAACAGCTATGACCCAGCTCACTACCTAATTTATGACAT are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 11: (g5e3F) TGTAAAACGACGGCCAGTacatttatgaacctgagaag and SEQ ID NO 12: (g5e4R) CAGGAAACAGCTATGACCCAGAATAGGGAAGCTAGAG are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 13: (q6ae1F) TGTAAAACGACGGCCAGTggggtggaagatacaatgac and SEQ ID NO 14: (q6ae1R) CAGGAAACAGCTATGACCCATAGAGCAGTCCTGGTTTTAC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 15: (q6be2F) TGTAAAA CGACGGCCAGTaaaataatgcccatctgttg and SEQ ID NO 16: (q6be2R) CAGGAAACAGCTATGACCGTGGAAGTCTACCATGATAAACATA are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 17: (q7e3F) TGTAAAACGACGGCCAGTcttccattccaagatccc and SEQ ID NO 18: (q7e4R) CAGGAAACAGCTATGACCGCAAAGTTCATTAGAACTGATC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 19: (g9e9F) TGTAAAACGACGGCCAGTtggatcatgggccatgtgc and SEQ ID NO 20: (g9e11R) CAGGAAACAGCTATGACCAAAGAGACATGGACACCAAATTAAG are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 21: (q13-1e1F) TGTAAAACGACGGCCAGTcgaggataaatgatttgctcaaag and SEQ ID NO 22: (q13-1e2R) CAGGAAACAGCTATGACCTCGTATAGAGTTGATTGGATTGAGA are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 23: (q13-2e1F) TGTAAAACGACGGCCAGTtcctaactgagaccttacac and SEQ ID NO 24: (q13-2e1R) CAGGAAACAGCTATGACCTTCTGTGGGGTGAAATAC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 25: (q14ae5F) TGTAAAACGACGGCCAGTgtggcatgaaaecgtactgt and SEQ ID NO 26: (q14ae6R) CAGGAAACAGCTATGACCACATCCCCAAACTATCTTAA are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 27: (q14be2F) TGTAAAACGACGGCCAGTatgggaggaataggtgaaga and SEQ ID NO 28: (q14be2R) CAGGAAACAGCTATGACCTGGATTACAATACATACAAACA are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 29: (q15e3F) TGTAAAACGACGGCCAGTggttaagggtgcatgctcttc and SEQ ID NO 30: (q15e4R) CAGGAAACAGCTATGACCGGCCCTATTGATGGTGGATC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 31: (q 17ae1F) TGTAAAACGACGGCCAGTacactttgtccactttgc and SEQ ID NO 32: (q17ae1R) CAGGAAACAGCTATGACCAGATGAGTATCGCACATTC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 33: (q17be1F) TGTAAAACGACGGCCAGTatctattcaaagaatggcac and SEQ ID NO 34: (q17be1R) CAGGAAACAGCTATGACCGATAACCTATAGAATGCAGC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 35: (q18e1F) TGTAAAACGACGGCCAGTtagatgctgtgatgaactg and SEQ ID NO 36: (q18e1R) CAGGAAACAGCTATGACCGAAGGAAAGAAGAGATAAGG are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 37: (q19e3F) TGTAAAACGACGGCCAGTcccgacaaataaccaagtgac and SEQ ID NO 38: (q19e4R) CAGGAAACAGCTATGACCGCTAACACATTGCTTCAGGCTAC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 39: (q22e1F) TGTAAAACGACGGCCAGTctgtcaaggttgtaaatagac and SEQ ID NO 40: (q22e1R) CAGGAAACAGCTATGACCAAGCAGGCATAATGATFC are preferably used together as forward (F) and reverse (R) primers; SEQ ID NO 41: (q24e1F) TGTAAAACGACGGCCAGTtattttcctttgagcctg and SEQ ID NO 42: (q24e1R) CAGGAAACAGCTATGACCGCAGAGGTAACTGTTCCAC are preferably used together as forward (F) and reverse (R) primers. These pairs of primers, which may be used in multiplex amplifications, can amplify the regions of the CFTR gene shown in Table 2.

TABLE 2 CFTR Primer Pairs and Amplicon Characteristics Forward Primer Reverse Primer Exon/Intron Size (in base pairs) q-promoter-1-1F q-promoter-1-1R qp1 553 (SEQ ID NO: 1) (SEQ ID NO: 2) q-promoter-2-1F q-promoter-2-1R qp2 634 (SEQ ID NO: 3) (SEQ ID NO: 4) q2e2F q2e2F exon 2 323 (SEQ ID NO: 5) (SEQ ID NO: 6) s3e1F s3e2R exon 3 345 (SEQ ID NO: 7) (SEQ ID NO: 8) q4e1F q4e1R exon 4 413 (SEQ ID NO: 9) (SEQ ID NO: 10) g5e3F g5e4R intron 5 425 (SEQ ID NO: 11) (SEQ ID NO: 12) q6ae1F q6ae1R exon 6a 334 (SEQ ID NO: 13) (SEQ ID NO: 14) q6be2F q6be2R exon 6b 341 (SEQ ID NO: 15) (SEQ ID NO: 16) q7e3F q7e4R exon 7 431 (SEQ ID NO: 17) (SEQ ID NO: 18) g9e9F g9e11R exon 9 396 (SEQ ID NO: 19) (SEQ ID NO: 20) q13-1e1F q13-1e2R exon 13-1 355 (SEQ ID NO: 21) (SEQ ID NO: 22) q13-2e1F q13-2e1R exon 13-2 584 (SEQ ID NO: 23) (SEQ ID NO: 24) q14ae5F q14ae6R exon 14a 281 (SEQ ID NO: 25) (SEQ ID NO: 26) q14be2F q14be2R exon 14b 223 (SEQ ID NO: 27) (SEQ ID NO: 28) q15e3F q15e4R exon 15 471 (SEQ ID NO: 29) (SEQ ID NO: 30) q17ae1F q17ae1R exon 17a 280 (SEQ ID NO: 31) (SEQ ID NO: 32) q17be1F q17be1R exon 17b 504 (SEQ ID NO: 33) (SEQ ID NO: 34) q18e1F q18e1R exon 18 471 (SEQ ID NO: 35) (SEQ ID NO: 36) q19e3F q19e4R exon 19 489 (SEQ ID NO: 37) (SEQ ID NO: 38) q22e1F q22e1R exon 22 446 (SEQ ID NO: 39) (SEQ ID NO: 40) q24e1F q24e1R exon 24 426 (SEQ ID NO: 41) (SEQ ID NO: 42)

If heterozygous polymorphism or mutation is present in one of the amplicons for exon 6b, the frameshift caused by the polymorphism or mutation will result in unreadable nucleotide sequences. Therefore, if a base change is detected in any one of these amplicons, sequencing should be performed to verify the sequence of another strand using an appropriate primer. This verification sequencing can be performed using the same PCR cleanup product as template. The verification sequencing primer for exon 6b is reflex6be1F (SEQ ID NO: 43): TTGATTGATTGATTGATTGATTT.

The nucleic acid to be amplified may be from a biological sample such as an organism, cell culture, tissue sample, and the like. The biological sample can be from a subject which includes any eukaryotic organism or animal, preferably fungi, invertebrates, insects, arachnids, fish, amphibians, reptiles, birds, marsupials and mammals. A preferred subject is a human, which may be a patient presenting to a medical provider for diagnosis or treatment of a disease. The biological sample may be obtained from a stage of life such as a fetus, young adult, adult, and the like. Particularly preferred subjects are humans being tested for the existence of a CF carrier state or disease state.

The sample to be analyzed may consist of or comprise blood, sera, urine, feces, epidermal sample, skin sample, cheek swab, sperm, amniotic fluid, cultured cells, bone marrow sample and/or chorionic villi, and the like. A biological sample may be processed to release or otherwise make available a nucleic acid for detection as described herein. Such processing may include steps of nucleic acid manipulation, e.g., preparing a cDNA by reverse transcription of RNA from the biological sample. Thus, the nucleic acid to be amplified by the methods of the invention may be DNA or RNA.

Nucleic acid may be amplified by one or more methods known in the art for copying a target nucleic acid, thereby increasing the number of copies of a selected nucleic acid sequence. Amplification may be exponential or linear. The sequences amplified in this manner form an “amplicon.” In a preferred embodiment, the amplification is performed by the polymerase chain reaction (“PCR”) (e.g., Mullis, K. et al., Cold Spring Harbor Symp. Quant. Biol. 51:263-273 (1986); Erlich H. et al., European Patent Application. 50,424; European Patent Application. 84,796, European Patent Application 258,017, European Patent Application. 237,362; Mullis, K., European Patent Application. 201,184; Mullis K. et al., U.S. Pat. No. 4,683,202; Erlich, H., U.S. Pat. No. 4,582,788; and Saiki, R. et al., U.S. Pat. No. 4,683,194). Other known nucleic acid amplification procedures that can be used include, for example, transcription-based amplification systems or isothermal amplification methods (Malek, L. T. et al., U.S. Pat. No. 5,130,238; Davey, C. et al., European Patent Application 329,822; Schuster et al., U.S. Pat. No. 5,169,766; Miller, H. I. et al., PCT application. WO 89/06700; Kwoh, D. et al., Proc. Natl. Acad. Sci. (U.S.A.) 86:1173 (1989); Gingeras, T. R. et al., PCT application WO 88/10315; Walker, G. T. et al., Proc. Natl. Acad. Sci. (U.S.A.) 89:392-396 (1992)). Amplification may be performed with relatively similar levels of each primer of a primer pair to generate a double stranded amplicon. However, asymmetric PCR may be used to amplify predominantly or exclusively a single stranded product as is well known in the art (e.g., Poddar et al. Molec. And Cell. Probes 14:25-32 (2000)). This can be achieved for each pair of primers by reducing the concentration of one primer significantly relative to the other primer of the pair (e.g. 100 fold difference). Amplification by asymmetric PCR is generally linear. One of ordinary skill in the art would know that there are many other useful methods that can be employed to amplify nucleic acid with the invention primers (e.g., isothermal methods, rolling circle methods, etc.), and that such methods may be used either in place of, or together with, PCR methods. Persons of ordinary skill in the art also will readily acknowledge that enzymes and reagents necessary for amplifying nucleic acid sequences through the polymerase chain reaction, and techniques and procedures for performing PCR, are well known. The examples below illustrate a standard protocol for performing PCR and the amplification of nucleic acid sequences that correlate with or are indicative of cystic fibrosis.

In another aspect, the present invention provides methods of detecting a cystic fibrosis genotype in a biological sample. The methods comprise amplifying nucleic acids in a biological sample of the subject and identifying the presence or absence of one or more mutant cystic fibrosis nucleic acid sequences in the amplified nucleic acid. Accordingly, the present invention provides a method of determining the presence or absence of one or more mutant cystic fibrosis nucleic acid sequences in a nucleic acid containing sample, comprising: contacting the sample with reagents suitable for nucleic acid amplification including one or more pairs of nucleic acid primers flanking one or more predetermined nucleic acid sequences that are correlated with cystic fibrosis, amplifying the predetermined nucleic acid sequence(s), if present, to provide an amplified sample; and identifying the presence or absence of mutant or wild type sequences in the amplified sample.

One may analyze the amplified product for the presence of absence of any of a number of mutant CF sequences that may be present in the sample nucleic acid. As already discussed, numerous mutations in the CFTR gene have been associated with CF carrier and disease states. For example, a three base pair deletion leading to the omission of a phenylalanine residue in the gene product has been determined to correspond to the mutations of the CF gene in approximately 50% of Caucasian patients affected by CF. The table below identifies preferred CF sequences and identifies which of the primer pairs of the invention may be used to amplify the sequence.

The CF mutations described herein also may be detected in conjunction with other CF mutations known in the art. Such additional CF mutations include, for example, those known under symbols: 2789+5G>A; 711+1G>T; W1282X; 3120+1G>A; d1507; dF508; (F508C, 1507V, 1506V); N1303K; G542X, G551D, R553X, R560T, 1717−1G>A: R334W, R347P, 1078delT; R117H, 1148T, 621+1G>T; G85E; R1162X, 3659delC; 2184delA; A455E, (5T, 7T, 9T); 3849+10 kbC>T; and 1898+1G>A. Additional CF mutations were disclosed in U.S. application Ser. No. 11/074,903 filed Mar. 7, 2005, such as 605G−>C, 1198-1203del/1204G−>A (deletes TGGGCT and replaces G with A at position 1204), 1484G−>T, 1573A−>G, 1604G−>C, 1641-1642AG−>T, 2949-2953del (deletes TACTC), 2978A−>T, 3239C−>A, and 3429C−>A, which are hereby incorporated by reference in their entirety. Any and all of these mutations can be detected using nucleic acid amplified with the invention primers as described herein or other suitable primers.

CF mutations in the amplified nucleic acid may be identified in any of a variety of ways well known to those of ordinary skill in the art. For example, if an amplification product is of a characteristic size, the product may be detected by examination of an electrophoretic gel for a band at a precise location. In another embodiment, probe molecules that hybridize to the mutant or wild type CF sequences can be used for detecting such sequences in the amplified product by solution phase or, more preferably, solid phase hybridization. Solid phase hybridization can be achieved, for example, by attaching the CF probes to a microchip. Probes for detecting CF mutant sequences are well known in the art.

CF probes for detecting mutations as described herein may be attached to a solid phase in the form of an array as is well known in the art (see, U.S. Pat. Nos. 6,403,320 and 6,406,844). For example, the full complement of 24 probes for CF mutations with additional control probes (30 in total) can be conjugated to a silicon chip essentially as described by Jenison et al., Biosens Bioelectron. 16(9-12): 757-63 (2001) (see also U.S. Pat. Nos. 6,355,429 and 5,955,377). Amplicons that hybridized to particular probes on the chip can be identified by transformation into molecular thin films. This can be achieved by contacting the chip with an anti-biotin antibody or streptavidin conjugated to an enzyme such as horseradish peroxidase. Following binding of the antibody (or streptavidin)-enzyme conjugate to the chip, and washing away excess unbound conjugate, a substrate can be added such as tetramethylbenzidine (TMB) {3,3′,5,5′Tetramethylbenzidine} to achieve localized deposition (at the site of bound antibody) of a chemical precipitate as a thin film on the surface of the chip. Other enzyme/substrate systems that can be used are well known in the art and include, for example, the enzyme alkaline phosphatase and 5-bromo-4-chloro-3-indolyl phosphate as the substrate. The presence of deposited substrate on the chip at the locations in the array where probes are attached can be read by an optical scanner. U.S. Pat. Nos. 6,355,429 and 5,955,377, which are hereby incorporated by reference in their entirety including all charts and drawings, describe preferred devices for performing the methods of the present invention and their preparation, and describes methods for using them.

The binding of amplified nucleic acid to the probes on the solid phase following hybridization may be measured by methods well known in the art including, for example, optical detection methods described in U.S. Pat. No. 6,355,429. In preferred embodiments, an array platform (see, e.g., U.S. Pat. No. 6,288,220) can be used to perform the methods of the present invention, so that multiple mutant DNA sequences can be screened simultaneously. The array is preferably made of silicon, but can be other substances such as glass, metals, or other suitable material, to which one or more capture probes are attached. In preferred embodiments, at least one capture probe for each possible amplified product is attached to an array. Preferably an array contains 10, more preferably 20, even more preferably 30, and most preferably at least 60 different capture probes covalently attached to the array, each capture probe hybridizing to a different CF mutant sequence. Nucleic acid probes useful as positive and negative controls also may be included on the solid phase or used as controls for solution phase hybridization.

Another approach is variously referred to as PCR amplification of specific alleles (PASA) (Sarkar, et al., 1990 Anal. Biochem. 186:64-68), allele-specific amplification (ASA) (Okayama, et al., 1989 J. Lab. Clin. Med. 114:105-113), allele-specific PCR (ASPCR) (Wu, et al., 1989 Proc. Natl. Acad. Sci. USA. 86:2757-2760), and amplification-refractory mutation system (ARMS) (Newton, et al., 1989 Nucleic Acids Res. 17:2503-2516). The method is applicable for single base substitutions as well as micro deletions/insertions. In general, two complementary reactions are used. One contains a primer specific for the normal allele and the other reaction contains a primer for the mutant allele (both have a common 2nd primer). One PCR primer perfectly matches one allelic variant of the target, but is mismatched to the other. The mismatch is located at/near the 3′ end of the primer leading to preferential amplification of the perfectly matched allele. Genotyping is based on whether there is amplification in one or in both reactions. A band in the normal reaction only indicates a normal allele. A band in the mutant reaction only indicates a mutant allele. Bands in both reactions indicate a heterozygote. As used herein, this approach will be referred to as “allele specific amplification.”

In yet another approach, restriction fragment length polymorphism (RFLP), which refers to the digestion pattern when various restriction enzymes are applied to DNA, is used. RFLP analysis can be applied to PCR amplified DNA to identify CF mutations as disclosed herein.

In still another approach, wild type or mutant CF sequence in amplified DNA may be detected by direct sequence analysis of the amplified products. A variety of methods can be used for direct sequence analysis as is well known in the art. See, e.g., The PCR Technique: DNA Sequencing (eds. James Ellingboe and Ulf Gyllensten) Biotechniques Press, 1992; see also “SCAIP” (single condition amplification/internal primer) sequencing, by Flanigan et al. Am J Hum Genet. 2003 April; 72(4):931-9. Epub 2003 Mar. 11. Direct sequencing of CF mutations is also described in Strom et al., 2003 Genetics in Medicine 5(1):9-14.

In yet another approach for detecting wild type or mutant CF sequences in amplified DNA, single nucleotide primer extension or “SNuPE” is used. SNuPE can be performed as described in U.S. Pat. No. 5,888,819 to Goelet et al., U.S. Pat. No. 5,846,710 to Bajaj, Piggee, C. et al. Journal of Chromatogaphy A 781 (1997), p. 367-375 (“Capillary Electrophoresis for the Detection of Known Point Mutations by Single-Nucleotide Primer Extension and Laser-Induced Fluorescence Detection”); Hoogendorn, B. et al., Human Genetics (1999) 104:89-93, (“Genotyping Single Nucleotide Polymorphism by Primer Extension and High Performance Liquid Chromatography”); and U.S. Pat. No. 5,885,775 to Haff et al. (analysis of single nucleotide polymorphism analysis by mass spectrometry).

Another method for detecting CF mutations include the Luminex xMAP system which has been adapted for cystic fibrosis mutation detection by TM Bioscience and is sold commercially as a universal bead array (Tag-It™).

Still another approach for detecting wild type or mutant CF sequences in amplified DNA is the oligonucleotide ligation assay or “OLA” or “OL”. The OLA uses two oligonucleotides which are designed to be capable of hybridizing to abutting sequences of a single strand of a target molecule. One of the oligonucleotides is biotinylated, and the other is detectably labeled. If the precise complementary sequence is found in a target molecule, the oligonucleotides will hybridize such that their termini abut, and create a ligation substrate that can be captured and detected. See e.g., Nickerson et al. (1990) Proc. Natl. Acad. Sci. U.S.A. 87:8923-8927, Landegren, U. et al. (1988) Science 241:1077-1080 and U.S. Pat. No. 4,998,617.

These above approaches for detecting wild type or mutant CF sequence in the amplified nucleic acid is not meant to be limiting, and those of skill in the art will understand that numerous methods are known for determining the presence or absence of a particular nucleic acid amplification product.

In another aspect the present invention provides kits for one of the methods described herein. The kit optionally contain buffers, enzymes, and reagents for amplifying the CFTR nucleic acid via primer-directed amplification. The kit also may include one or more devices for detecting the presence or absence of particular mutant CF sequences in the amplified nucleic acid. Such devices may include one or more probes that hybridize to a mutant CF nucleic acid sequence, which may be attached to a bio-chip device, such as any of those described in U.S. Pat. No. 6,355,429. The bio-chip device optionally has at least one capture probe attached to a surface on the bio-chip that hybridizes to a mutant CF sequence. In preferred embodiments the bio-chip contains multiple probes, and most preferably contains at least one probe for a mutant CF sequence which, if present, would be amplified by a set of flanking primers. For example, if five pairs of flanking primers are used for amplification, the device would contain at least one CF mutant probe for each amplified product, or at least five probes. The kit also preferably contains instructions for using the components of the kit.

The following examples serve to illustrate the present invention. These examples are in no way intended to limit the scope of the invention.

EXAMPLES Example 1 Sample Collection and Preparation

Whole Blood: 5 cc of whole blood is collected in a lavender-top (EDTA) tube or yellow-top (ACD) tube. Green-top (Na Heparin) tubes are acceptable but less desirable. DNA is extracted from blood. 100 ng or more DNA is prepared in TE or sterile water.

Amniotic Fluid: 10-15 cc of Amniotic Fluid is collected in a sterile plastic container.

Cultured Cells: Two T-25 culture flasks with 80-100% confluent growth may be used.

Chorionic Villi: 10-20 mg of Chorionic Villi are collected in a sterile container. 2-3 ml of sterile saline or tissue culture medium is added.

Transport: Whole Blood, Amniotic Fluid, Cultured Cells and Chorionic Villi can be shipped at room temperature (18°-26° C.). Amniotic Fluid, Cultured Cells or Chorionic Villi preferably is used without refrigeration or freezing. Whole Blood and Extracted DNA can be shipped at 2°-10° C.

Storage: Whole Blood, Amniotic Fluid and Extracted DNA are stored at 2°-10° C. Amniotic Fluid is stored at 2°-10° C. only after the aliquot is removed for culturing. Cultured Cells and Chorionic Villi are stored at room temperature (18°-26° C.).

Stability: Whole Blood is generally stable for 8 days at room temperature (180-26° C.) or 8 days refrigerated at 2-10° C. Amniotic Fluid, Cultured Cells, and Chorionic Villi are generally processed to obtain DNA within 24 hours of receipt. Extracted DNA is stable for at least 1 year at 20-10° C.

Example 2 Amplification from DNA

Polymerase chain reaction (PCR) primer pairs were designed using the CFTR gene sequences in EMBL/Genbank (Accession Nos. M55106-M55131). Each PCR primer for the 32 separate PCR reactions contains either an M13 forward linker sequence or an M13 reverse linker sequence as appropriate to allow universal sequence reaction priming. Individual PCR reactions are performed in 96-well microtiter plates under the same conditions for each amplicon. Subsequently, the PCR products are purified with the Millipore Montage™ PCR₉₆ Cleanup kit (Millipore, Bedford, Mass.) on a Beckman BioMek 2000 biorobot. Further details are provided in Strom et al., 2003 Genetics in Medicine 5(1):9-14.

In general, individual amplifications were prepared in a volume of 25 μl, which is added to the 96 well microtiter plates. Each amplification volume contained 2 μl of the nucleic acid sample (generally 10-100 ng of DNA), 19 μl of PCR-Enzyme Mix (PCR mix stock is prepared with 2.5 μl of 10× PCR buffer, 0.5 μl Hot Start Taq (Qiagen Inc., Cat No. 203205), 0.5 μl MgCl₂ (from 25 mM stock), PCR primers, and 0.2 μl of 25 mM dNTP). Master mix contained primers, Qiagen PCR buffer with MgCl₂, bovine serum albumin (BSA) (New England BioLabs, Cat no. B9001B), and dNTPs (Amersham Biosciences, Cat no. 27-2032-01).

The final concentration in the PCR for MgCl₂ was 2.0 mM, for BSA was 0.8 μg/μl, and for each dNTP was 0.2 mM. Primer final concentrations varied from about 1.2 μM to about 0.4 μM.

PCR was conducted using the following temperature profile: step 1: 96° C. for 15 minutes; step 2: 94° C. for 15 seconds; step 3: decrease at 0.5° C./second to 56° C.; step 4: 56° C. for 20 seconds; step 5: increase at 0.3° C./second to 72° C., step 6: 72° C. for 30 seconds; step 7: increase 0.5° C./second up to 94° C.; step 8: repeat steps 2 to 7 thirty three times; step 9: 72° C. for 5 minutes; step 10: 4° C. hold (to stop the reaction).

Example 3 Detection of CF Mutations

The purified PCR products were diluted to approximately 10 ng/μL and cycle sequencing reactions were performed with an ABI Prism Big Dye™ Terminator v3.0 cycle sequencing reaction kit (Applied Biosystems, Foster City, Calif.) according to the manufacturer's protocol. The DNA primers used for the sequencing reaction were M13 forward and reverse primers. Big Dye™ Terminator reaction products were purified by ethanol precipitation and analyzed on an ABI Prism 3100 Genetic Analyzer. Sequences obtained were examined for the presence of mutations by using ABI SeqScape v2.0 software. Both strands of DNA were sequenced.

PCR reactions, purifications, and cycle sequencing reactions were performed in 96-well microtiter plates using biorobots to avoid errors introduced by manual setups. Loading of samples onto the capillary sequencer was also automated. One plate was generally sufficient to perform the entire sequencing reaction for a single patient. Theoretically, if all reactions were successful, the entire sequences for a single patient could be obtained in 24-48 hours after receipt of blood. In practice, however, one or more reactions may need to be repeated because of polymorphisms in intron 8 and 6a or failed reactions.

The contents of the articles, patents, and patent applications, and all other documents and electronically available information mentioned or cited herein, are hereby incorporated by reference in their entirety to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference.

Applicants reserve the right to physically incorporate into this application any and all materials and information from any such articles, patents, patent applications, or other physical and electronic documents.

The inventions illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms “comprising”, “including,” containing”, etc. shall be read expansively and without limitation. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the inventions embodied therein herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention.

The invention has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein. Other embodiments are within the following claims. In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group. 

1. A method of determining if a cystic fibrosis transmembrane regulatory (CFTR) gene contains one or more mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G, comprising determining whether CFTR nucleic acid contains one or more of said mutations.
 2. A method of identifying an individual that has one or more mutations in the cystic fibrosis transmembrane regulatory (CFTR) gene comprising determining if nucleic acid from the individual has one or more mutations in one or more CFTR genes, said mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1 G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G.
 3. (canceled)
 4. (canceled)
 5. The method of claim 1 wherein said mutations selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), and 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted).
 6. The method of claim 1 wherein said mutations selected from the group consisting of 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 37Q4T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, and −141C>A.
 7. The method of claim 1 wherein both alleles in the individual are evaluated for said one or more mutations.
 8. The method of claim 1 wherein genomic DNA is tested for the presence of said one or more mutations.
 9. The method of claim 1 wherein a cDNA copy of the CFTR coding sequence is tested for the presence of said one more mutations.
 10. The method of claim 1 wherein sequence from the CFTR gene is amplified by the polymerase chain reaction and the amplified sequence is tested for the presence of said one or more mutations.
 11. The method of claim 1 wherein the presence of said one or more mutations is determined by nucleic acid sequencing.
 12. The method of claim 1 wherein the presence of said one or more mutations is determined by restriction fragment length polymorphism analysis following treatment of CFTR gene sequence with at least one endonuclease restriction enzyme.
 13. The method of claim 1 wherein the presence of said one or more mutations is determined by allele specific amplification.
 14. The method of claim 1 wherein the presence of said one or more mutations is determined by primer extension.
 15. The method of claim 1 wherein the presence of said one or more mutations is determined by oligonucleotide ligation.
 16. The method of claim 1 wherein the presence of said one or more mutations is determined by hybridization with a detectably labeled probe containing the mutant sequence.
 17. The method of claim 1 wherein the presence of said one or more mutations is determined by detecting the mutation in the encoded CFTR protein using an antibody with binding specificity for the mutated CFTR protein.
 18. A substantially purified nucleic acid comprising 8-20 nucleotides fully complementary to a segment of the cystic fibrosis transmembrane regulatory (CFTR) gene that is fully complementary to a portion of the CFTR gene and encompasses a mutant CFTR sequence selected from the group consisting of 3443A>T, 2443delA (A at position 2443 is deleted), 2777insTG (TG are inserted at position 2777), 3123-3125delGTT (GTT at positions 3123-3125 are deleted), 4177delG (G at position 4177 is deleted), 630delG (G at position 630 is deleted), 2068G>T, 1342−2A>G (A in the splice acceptor site of intron 8, 2 nucleotides upstream of position 1342, is substituted with G), 297−1 G>A (G in the splice acceptor site of intron 2, 1 nucleotide upstream of position 297, is substituted with A) 3500−2A>T (A in the splice acceptor site of intron 17b, 2 nucleotides upstream of position 3500, is substituted with T), 4375−2A>G (A in the splice acceptor site of intron 23, 2 nucleotides upstream of position 4375, is substituted with G), 3172-3174delTAC (TAC at positions 3172 to 3174 are deleted), 2902G>C, 4115T>C, 4185G>C, 520C>G, 842A>C, 4528G>T, 448A>G, 574A>T, 3704T>C, 1248+5T>C (T in the splice donor site of intron 7, 5 nucleotides downstream of position 1248, is substituted with C), 296+12T>G (T in intron 2, 12 nucleotides downstream of position 296, is substituted with G), 3849+3G>A (G in the splice donor site of intron 19, 3 nucleotides downstream of position 3849, is substituted with A), 497A>G, −141C>A, 2875G>C, 2689A>G, 3039A>G, 405G>C, 886G>A, 4445G>A, −228G>C, −295C>T, −379delC (C at position −379 is deleted), and −540A>G, or a complementary nucleic acid sequence thereof, wherein the purified nucleic acid is no more than 50 nucleotides in length.
 19. The substantially purified nucleic acid of claim 18 wherein said nucleic acid is labeled with a detectable label.
 20. The substantially purified nucleic acid of claim 19 wherein said detectable label is selected from the group consisting of a radioisotope, a dye, a fluorescent molecule, a hapten and a biotin molecule. 